Towards a Computational Semantic Analyzer for Urdu
نویسندگان
چکیده
This paper describes a first approach to a computational semantic analyzer for Urdu on the basis of the deep syntactic analysis done by the Urdu grammar ParGram. Apart from the semantic construction, external lexical resources such as an Urdu WordNet and a preliminary VerbNet style resource for Urdu are developed and connected to the semantic analyzer. These resources allow for a deeper level of representation by providing real-word knowledge such as hypernyms of lexical entities and information on thematic roles. We therefore contribute to the overall goal of providing more insights into the computationally efficient analysis of Urdu, in particular to computational semantic analysis.
منابع مشابه
A First Approach Towards an Urdu WordNet
This paper reports on a first experiment with developing a lexical knowledge resource for Urdu on the basis of Hindi WordNet. Due to the structural similarity of Urdu and Hindi, we can focus on overcoming the differences in the scriptual systems of the two languages by using transliterators. Various natural language processing tools, among them a computational semantics based on the Urdu ParGra...
متن کاملTowards Building Semantic Role Labeler for Indian Languages
We present a statistical system for identifying the semantic relationships or semantic roles for two major Indian Languages, Hindi and Urdu. Given an input sentence and a predicate/verb, the system first identifies the arguments pertaining to that verb and then classifies it into one of the semantic labels which can either be a DOER, THEME, LOCATIVE, CAUSE, PURPOSE etc. The system is based on 2...
متن کاملDiscovering Semantic Classes for Urdu N-V Complex Predicates
This paper reports on an exploratory investigation as to whether classes of Urdu N-V complex predicates can be identified on the basis syntactic patterns and lexical choices associated with the N-V complex predicates. Working with data from a POS annotated corpus, we show that choices with respect to the number of arguments, case marking on subjects and which light verbs are felicitous with whi...
متن کاملEncoding event structure in Urdu/Hindi VerbNet
We propose a new kind of event structure representation for computational linguistics, based on the theoretical framework of FirstPhase Syntax (Ramchand, 2008). We show that the approach not only gives a theoretically well-motivated set of subevents and related semantic roles, it also posits the levels of representation needed for analyzing a linguistic phenomenon that has repeatedly caused pro...
متن کاملA Computational Treatment of Differential Case Marking in Malayalam
Case is often treated as an uninteresting part of computational processing (both parsing and generation). In the mainly free word order South Asian languages, case plays a theoretically well established role in syntactic and semantic processing. Case is used not only to help identify grammatical relations (e.g., ergatives indicate subjects), but also contributes significantly to the semantic an...
متن کامل